CDS

Accession Number TCMCG075C07047
gbkey CDS
Protein Id XP_007044548.2
Location join(31158700..31158702,31158840..31158905,31159002..31159139,31159337..31160229,31160351..31160421,31160761..31160863,31161028..31161103,31161760..31161867)
Gene LOC18609397
GeneID 18609397
Organism Theobroma cacao

Protein

Length 485aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007044486.2
Definition PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category A
Description U4 U6 small nuclear ribonucleoprotein
KEGG_TC -
KEGG_Module M00354        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K12844        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCAACTCTTGCTGATTCATTTCTTGCGGACCTTGATGAATTATCGGACAACGAAGCCGATGTTCTCGAGGAAGAAAATGATGATGTCACCAACATGGAAGAAGATGTTGATGGTGACCTGGCCGATATTGAAGCCCTTAACTACGATGAGCTGGACAGCGTTTCGAAATTGCAAAAAACTCAGAGATACATTGATATAATGCAGAAAGTGGAAGATGCACTCGAGAAGGGTTCTGATATATCAAATCAGGGAATGGTATTGGAAGATGATCCTGAGTATCAGCTGATTGTGGACTGTAATTCACTATCGGTTGACATTGAGAATGAGATTGTTATTATCCATAATTTTATACGTGATAAGTACCGGTTGAAGTTTCCTGAGCTTGAATCACTTGTACATCATCCGATTGATTATGCTCGTGTAGTGAAAAAGATTGGCAATGAGATGGATTTAACCCTGGTTGATTTGGAAGGACTTTTGCCTTCGGCTATCATTATGGTTGTTTCAGTTACAGCATCGACTACTAGTGGCAAGCCACTTCCAGAAGATGTTCTTCAAAAAACTATTGATGCATGTGATCGTGCTCTTGCTTTAGACATGGCAAAGAAAAAGGTTCTTGATTTTGTAGAAAGTAGAATGGGATATATTGCACCAAATCTTTCTACTATTGTTGGGAGTGCTGTTGCTGCTAAACTTATGGGTACTGCTGGTGGTCTTTCAGCGTTAGCTAAGATGCCTGCTTGTAACGTTCAGCTGCTTGGTGCAAAGAAAAAGACCCTTGCAGGGTTTTCTACTGCAACGTCACAATTTCGTGTTGGTTATATTGAACAGACAGAGATTTTTCAATCTACACCCCCGGCTTTGAGAAGTCGTGCTTGCCGTCTCTTGGCTTCAAAAGCAACACTTGCGGCACGAATAGATTCTACTCGAGGGGATCCATCAGGGAATGCTGGAAGAACTCTGAAGGATGAGATCCATAAGAAAATTGAAAAGTGGCAAGAGCCACCTCCCGCAAAGCAGCCTAAACCCCTTCCTGTTCCTGATTCTGAACCTAAGAAAAAGAGAGGTGGCCGTCGCTTAAGGAAGATGAAGGAGAGGTATGCTATAACAGACATGAGGAAACTGGCAAACAGGATGCAATTTGGTGTACCTGAGGAGAGCTCCTTAGGTGATGGACTCGGTGAAGGCTATGGAATGCTTGGTCAGGCTGGGAGTGGAAAACTGCGTGTATCAGTTGGTCAGAGCAAACTTGCGGCAAAAGTTGCTAAGAAGTTCAAGGAAAAGCATGGCGGAAGCAGTGGTGCTACCTCTGGACTGACTTCAAGTTTGGCATTCACACCTGTGCAGGGGATTGAGCTCACAAACCCTCAAGCTCATGCACATCAGCTTGGCAGTGGAACTCAAAGTACTTATTTCTCTGAGACTGGAACCTTTTCGAAGATCAAAAGGACATGA
Protein:  
MATLADSFLADLDELSDNEADVLEEENDDVTNMEEDVDGDLADIEALNYDELDSVSKLQKTQRYIDIMQKVEDALEKGSDISNQGMVLEDDPEYQLIVDCNSLSVDIENEIVIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAIIMVVSVTASTTSGKPLPEDVLQKTIDACDRALALDMAKKKVLDFVESRMGYIAPNLSTIVGSAVAAKLMGTAGGLSALAKMPACNVQLLGAKKKTLAGFSTATSQFRVGYIEQTEIFQSTPPALRSRACRLLASKATLAARIDSTRGDPSGNAGRTLKDEIHKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYAITDMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSVGQSKLAAKVAKKFKEKHGGSSGATSGLTSSLAFTPVQGIELTNPQAHAHQLGSGTQSTYFSETGTFSKIKRT